Fitting Logistie IRT Modeis: Small Wonder
نویسنده
چکیده
Smte-oi-rbe-art ítem response theo,y (IRT) models use logistie functions exclusivcly as their ítem response functions (IREs). Logistie funcííons mee Ihe reqrtírements thai iheir range is che unit iníerval ami íhat they are monotonically increasing. bur they upase a parameter ipace whose dimensions can only be assigned a meíaphoñcal interpretation in the context of cesting. Applications of 1RT models require obtaining ihe set of values itt Ingístie functíon paranieters thai besí fu an empirícal dato set. l-lowever. success in ohtaininsi such set of values does ncc guarantee thai che constructs they represent actually exisí, for the adequacy of a model is nol sustained hy ihe possihility of estimaring parameters. Thís article illustrates bow meehanical adoplion of off-theshelf logistic lunc[ions as IRFs lbr IRT modeis can result in off-the-shelf parameter estímates and tUs to dato. Thc results of a simulation study are presenced, which show that logistie 11ff modeis can tít a set of dato generated by IREs other chan logiscic funccions just as well as they ¡it iogisric dato. even though the response processes and parameter spaces involved in each case are subsíantially dírferení. An explanation of why logistic funetions wnrk os lhey do is ofreced, che íhcorecical and practícal consequences of Iheir behavior are discuised, and a testable altemative lo logistic IRFs is commented cpon. Key ~vúnIs.goodnass ej/ii, paran cier esíimwíou, ile»> nxsponse theory, legisle: models, Jiniie siate poIynrníur: ,nodels, BILGC
منابع مشابه
Beyond Majority Voting: Generating Evaluation Scales using Item Response Theory
We introduce Item Response Theory (IRT) from psychometrics as an alternative to majority voting to create an IRT gold standard (GSIRT ). IRT describes characteristics of individual items in GSIRT their difficulty and discriminating power and is able to account for these characteristics in its estimation of human intelligence or ability for an NLP task. In this paper, we evaluated IRT’s model-fi...
متن کاملIRT-FIT: SAS® Macros for Fitting Item Response Theory (IRT) Models
Psychometrics has recently seen the development of complex measurement models to better represent test and item data. Item Response Theory (IRT), in particular, comprises a set of non-linear latent variable models that appear to have several conceptual and empirical properties that make them more valuable in practice than classical test theory methods. However, IRT-based models typically requir...
متن کاملIdentifying DIF for Latent Classes with the Dirichlet Process
of the Dissertation Identifying DIF for Latent Classes with the Dirichlet Process by Miles Satori Chen Doctor of Philosophy in Statistics University of California, Los Angeles, 2015 Professor Peter Bentler, Chair In Item Response Theory (IRT), Differential Item Functioning (DIF) occurs when individuals who have the same ability, but belong to different groups, have different probabilities of an...
متن کاملBuilding an Evaluation Scale using Item Response Theory
Evaluation of NLP methods requires testing against a previously vetted gold-standard test set and reporting standard metrics (accuracy/precision/recall/F1). The current assumption is that all items in a given test set are equal with regards to difficulty and discriminating power. We propose Item Response Theory (IRT) from psychometrics as an alternative means for gold-standard test-set generati...
متن کاملAssessing IRT Model-Data Fit for Mixed Format Tests
This study examined various model combinations and calibration procedures for mixed format tests under different item response theory (IRT) models and calibration methods. Using real data sets that consist of both dichotomous and polytomous items, nine possibly applicable IRT model mixtures and two calibration procedures were compared based on traditional and alternative goodnessof-fit statisti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003